SAFEGUI: resampling-based tests of categorical significance in gene expression data made easy

نویسندگان

  • Daniel M. Gatti
  • Myroslav Sypa
  • Ivan Rusyn
  • Fred A. Wright
  • William T. Barry
چکیده

SUMMARY A large number of websites and applications perform significance testing for gene categories/pathways in microarray data. Many of these packages fail to account for expression correlation between transcripts, with a resultant inflation in Type I error. Array permutation and other resampling-based approaches have been proposed as solutions to this problem. SAFEGUI provides a user-friendly graphical interface for the assessment of categorical significance in microarray studies, while properly accounting for the effects of correlations among genes. SAFEGUI incorporates both permutation and more recently proposed bootstrap algorithms that are demonstrated to be more powerful in detecting differential expression across categories of genes. AVAILABILITY http://cebc.unc.edu/software/.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Testing the Statistical Significance of Linear Programming Estimators

Linear programming-based estimation procedures are used in a variety of arenas. Two notable areas are multi-attribute utility models (LINMAP) and production frontiers (Data Envelopment Analysis, DEA). Both LINMAP and DEA have theoretical and managerial advantages. For example, LINMAP treats ordinal scaled preference data as such in uncovering individual level attribute weights, while regression...

متن کامل

Methods for significance testing of categorical covariates in logistic regression models after multiple imputation: power and applicability analysis

BACKGROUND Multiple imputation is a recommended method to handle missing data. For significance testing after multiple imputation, Rubin's Rules (RR) are easily applied to pool parameter estimates. In a logistic regression model, to consider whether a categorical covariate with more than two levels significantly contributes to the model, different methods are available. For example pooling chi-...

متن کامل

Multiple Testing for Gene Expression Data: An Investigation of Null Distributions with Consequences for the Permutation Test

Gene expression studies produce data with which inferences can be made for thousands of genes simultaneously, allowing researchers to answer questions such as “Which genes are significantly differently expressed between two (or more) conditions?” or “Which genes have a significant association with an outcome or covariate?”. In order to make statements about the statistical significance of thous...

متن کامل

Assessing Differential Expression in Two-Color Microarrays: A Resampling-Based Empirical Bayes Approach

Microarrays are widely used for examining differential gene expression, identifying single nucleotide polymorphisms, and detecting methylation loci. Multiple testing methods in microarray data analysis aim at controlling both Type I and Type II error rates; however, real microarray data do not always fit their distribution assumptions. Smyth's ubiquitous parametric method, for example, inadequa...

متن کامل

Advanced significance analysis of microarray data based on weighted resampling: a comparative study and application to gene deletions in Mycobacterium bovis

MOTIVATION When analyzing microarray data, non-biological variation introduces uncertainty in the analysis and interpretation. In this paper we focus on the validation of significant differences in gene expression levels, or normalized channel intensity levels with respect to different experimental conditions and with replicated measurements. A myriad of methods have been proposed to study diff...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 25  شماره 

صفحات  -

تاریخ انتشار 2009